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CLAIMS 

Having thus described our invention, what we claim as new and desire 
to secure by Letters Patent is as follows: 

f 

1 . An automated method for setting up an instance of a natural language 
conversational interface inla Web site comprising the steps of: 

defining a hierarchyi of topics into which individual documents or Web 
pages can be classified; \ 

/ generating a keyword index for those documents for an associated 
search engine; and ^ 

for each node in the hierarchy, specifying a mechanism for associating 
an input natural language (NL) query to the node. 



2. The automated method for setting up an instance of a natural language 
conversational interface in a Web site recited in claim 1, wherein the step of 
generating a keyword index comprises the step of extracting sparse n-grams of 
keywords for each group of pages\n the topic hierarchy. 



3. The automated method for setting up an instance of a natural language 
conversational interface in a Web si)e recited in claim 1, further comprising 
the step of optionally reviewing and editing the keyword index. 



4. An automated method for setting up an instance of a natural language 
interface in a Web site comprising the s)eps of: 

automatically inducing a classification hierarchy by examining a 
structure of the Web site; \ 

creating index terms for leaf pages from sparse n-grams; and 





creating rules for a classification engine from the sparse n-grams of 
pages reachable from each node in a hierarchy of leaf pages, wherein each 
node is a classification category and the rules associated with that category are 
used to decide if a new input document or query reference the node. 

5. The automated method for setting up an instance of a natural language 
interface in a Web site recited in claim 4, wherein the step of creating rules for 
a classification engine is performed automatically and fiirther comprising the 
optional step of manually editing me rules. 
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